Decision-Theoretic Planning

نویسنده

  • Jim Blythe
چکیده

ions and hierarchical approaches While the envelope extension method ignores portions of the state space, other techniques have considered abstractions of the state space that try to group together sets of states that behave similarly under the chosen actions of the optimal policy. Boutilier and Dearden (Boutilier & Dearden 1994) assume a representation for actions that is similar to that of Buridan (Kushmerick, Hanks, & Weld 1994) described earlier and a state utility function that is described in terms of domain literals. They then pick a subset of the literals that accounts for the greatest variation in the state utility and use the action representation to find literals that can directly or indirectly affect the chosen set, using a technique similar to the one developed by Knoblock for building abstraction hierarchies for classical planners (Knoblock 1991). This subset of literals then forms the basis for an abstract MDP by projection of the original states. Since the state space size is exponential in the set of literals, this reduction can lead to considerable time savings over the original MDP. Boutilier and Dearden prove bounds on the difference in value of the abstract policy compared with an optimal policy in the original MDP. Dean and Lin refine this idea by splitting the MDP into subsets and allowing a different abstraction of the states to be considered in each one (Dean & Lin 1995). This approach can provide a better approximation because typically different literals may be relevant in different parts of the state space. However there is an added cost to re-combining the separate pieces unless they happen to decompose very cleanly. Dean and Lin assume the partition of the state space is given by some external oracle. Factored action and state representations While the last approaches exploited structure in the state utility description it is also possible to exploit it in the action description. Boutilier et al. extend modified policy iteration to propose a technique called structured policy iteration that makes use of a structured action representation in the form of 2-stage Bayesian networks (Boutilier, Dearden, & Goldszmidt 1995). The representation of the policy and utility functions are also structured, using decision trees. In standard policy iteration, the value of the candidate policy is computed on each iteration by solving a system of jSj linear equations (step 2 in Table 2), which is computationally prohibitive for large real-world planning problems. Modified policy iteration replaces this step with an iterative approximation of

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Decision-Theoretic Planning Approach to Synthesizing High-Quality Contract Anytime Algorithms

In this paper, we develop a decision-theoretic planning approach to synthesizing high-quality contract anytime algorithms. Anytime algorithms are considered as time-parameterized probabilistic actions. Given a set of primitive anytime algorithms and their performance prooles, the problem of synthesizing high-quality anytime algorithms is translated to a decision-theoretic planning problem.

متن کامل

Using Decision-Theoretic Planning Agents in Market-Based Systems

This paper examines a number of theoretical and practical issues concerning the use of decision-theoretic planning to implement agents in market-based systems. The markets considered here result from the decomposition of complex, intra-organizationai resource allocation problems such as manufacturing scheduling. Although these problems can be formulated as monolithic optimization problems, they...

متن کامل

cient Decision - Theoretic Planning : Techniques and EmpiricalAnalysisPeter

This paper discusses techniques for performing eecient decision-theoretic planning. We give an overview of the drips decision-theoretic reenement planning system, which uses abstraction to eeciently identify optimal plans. We present techniques for automatically generating search control information , which can signiicantly improve the planner's performance. We evaluate the ee-ciency of drips b...

متن کامل

Decision-theoretic Refinement Medical Decision Making: Planning in Management of Acute Deep Venous Thrombosis

Decision-theoretic refinement planning is a new technique for finding optimal courses of action. The authors sought to determine whether this technique could identify optimal strategies for medical diagnosis and therapy. An existing model of acute deep venous thrombosis of the lower extremities was encoded for analysis by the decision-theoretic refinement planning system (DRIPS). The encoding r...

متن کامل

Decision-Theoretic Subgoaling for Planning with External Events

I describe a planning methodology for domains with uncertainty in the form of external events that are not completely predictable. Under certain conditions, these events can be modelled as continuous-time Markov chains whose states are characterised by the planner’s domain predicates. Planning is goal-directed, but the subgoals are suggested by analysing the utility of the partial plan rather t...

متن کامل

Against Optimality: Logical Foundations for Decision-Theoretic Planning in Autonomous Agents

This paper investigates decision-theoretic planning in sophisticated autonomous agents operating in environments of real-world complexity. An example might be a planetary rover exploring a largely unknown planet. It is argued th a t existing algorithms for decision-theoretic planning are based on a logically incorrect theory of rational decision making. Plans cannot be evaluated directly in ter...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • AI Magazine

دوره 20  شماره 

صفحات  -

تاریخ انتشار 1999